Estimating the Class Posterior Probabilities in Protein Secondary Structure Prediction
نویسندگان
چکیده
Support vector machines, let them be bi-class or multi-class, have proved efficient for protein secondary structure prediction. They can be used either as sequence-to-structure classifier, structure-to-structure classifier, or both. Compared to the classifier most commonly found in the main prediction methods, the multi-layer perceptron, they exhibit one single drawback: their outputs are not class posterior probability estimates. This paper addresses the problem of post-processing the outputs of multi-class support vector machines used as sequence-to-structure classifiers with a structure-to-structure classifier estimating the class posterior probabilities. The aim of this comparative study is to obtain improved performance with respect to both criteria: prediction accuracy and quality of the estimates.
منابع مشابه
Protein Secondary Structure Prediction: a Literature Review with Focus on Machine Learning Approaches
DNA sequence, containing all genetic traits is not a functional entity. Instead, it transfers to protein sequences by transcription and translation processes. This protein sequence takes on a 3D structure later, which is a functional unit and can manage biological interactions using the information encoded in DNA. Every life process one can figure is undertaken by proteins with specific functio...
متن کاملEstimating the Class Posterior Probabilities in Biological Sequence Segmentation
To tackle segmentation problems on biological sequences, we advocate the use of a hybrid architecture combining discriminant and generative models in the framework of a hierarchical approach. Multi-class support vector machines and neural networks provide a set of initial predictions. These predictions are postprocessed by classifiers estimating the class posterior probabilities. The outputs of...
متن کاملProtein Secondary Structure Prediction Using Support Vector Machines and a New Feature Representation
Knowledge of the secondary structure and solvent accessibility of a protein plays a vital role in the prediction of fold, and eventually the tertiary structure of the protein. A challenging issue of predicting protein secondary structure from sequence alone is addressed. Support vector machines (SVM) are employed for the classification and the SVM outputs are converted to posterior probabilitie...
متن کاملIn Silico Prediction of B-Cell and T-Cell Epitopes of Protective Antigen of Bacillus anthracis in Development of Vaccines Against Anthrax
Protective antigen (PA), a subunit of anthrax toxin from Bacillus anthracis, is known as a dominant component in subunit vaccines in protection against anthrax. In order to avoid the side effects of live attenuated and killed organisms, the use of linear neutralizing epitopes of PA is recommended in order to design recombinant vaccines. The present study is aimed at determining the dominant epi...
متن کاملPrediction of Secondary Structure of Citrus Viroids Reported from Southern Iran
Abstract Viroids are smallest, single-stranded, circular, highly structured plant pathogenic RNAs that do not code for any protein. Viroids belong to two families, the Avsunviroidae and the Pospiviroidae. Members of the Pospiviroidae family adopt a rod-like secondary structure. In this study the most stable secondary structures of citrus viroid variants that reported from Fars province wer...
متن کامل